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© In a memory system having a main memory and 
a faster cache memory, a cache memory replace- 
ment scheme with a locking feature is provided. 
Locking bits associated with each line in the cache 
are supplied in the tag table. These locking bits -are 
preferably set and reset by the application 
program/process executing and are utilized in con- 
junction with cache replacement bits by the cache 
controller to determine the lines in the cache to 
replace. The locking bits "lock" the line of data in 
the cache until such time when the process resets 
the lock bit By providing that the process controls 
the state of the lock bits, the intelligence and knowl- 
edge the process contains regarding the frequency 
of use of certain memory locations can be utilized to 
provide a more efficient cache. 
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BACKGROUND OF THE INVENTION 
1 . Field of the Invention: 



This invention relates to the field of computer 
cache memory devices. More particularly, the 
present invention relates to a method and appara- 
tus for "locking" data into the cache memory such 
that a program can designate pages or blocks of 
memory which should remain in the cache. 

2. Art Background: 

A simple way to increase the throughput of a 
computer processor is to increase the frequency of 
the clock driving the processor. However, when the 
processor clock frequency is increased, the pro- 
cessor may begin to exceed the speed at which 
the main memory can respond to the processor's 
requests. The processor may therefore be forced 
to wait for the main memory to respond. In order to 
alleviate this main memory latency period, cache 
memory was created. 

Cache memory refers to a small amount of 
high-speed memory that is coupled closely to the 
processor. The cache memory is used to duplicate 
a subset of main memory locations. When a pro- 
cessor needs data from memory, it will first look 
into the high-speed cache memory. If the data is 
found in the cache memory (known as a "hit"), the 
data will be retrieved from the cache memory and 
execution will resume. If the data is not found in 
the cache memory (known as a "miss") then the 
processor will proceed to look into the slower main 
memory. 

For example, if a particular program will refer 
to a particular data table in the main memory often, 
it would be desirable to place a copy of the data 
table into a high-speed cache memory. If a copy of 
the data table is kept in the cache memory, then 
each time the processor needs data from the data 
table it -will be retrieved quickly. 

Cache memories usually store only a small 
subset of the main memory. When every location 
in the cache memory is filled, the cache memory 
must discard some of the data from what is cur- 
rently in store. Determining which memory cache 
locations to discard is a difficult task since it is 
often not known which cache memory locations wtll" 
be needed in the future. Various heuristics have 
been developed to aid in determining which main 
memory locations will be duplicated in the high- 
speed cache memory. 

Referring to Figure 1, a high level block dia- 
gram of a prior art cache memory system is 
shown. The main memory 10, cache memory sys- 
tem 12 and processor 14 are coupled in a bus 16. 
The processor issues memory requests to the 



cache memory system 12. If the information is 
available in the cache memory 15 the information 
requested is immediately forwarded to processor 
1 4 via a dedicated line 1 8. If the information is not 

5 located in the cache memory 15, the request is 
forwarded to the slower main memory 10, which 
provides the information requested to processor 14 
via the bus 16. 

There are many methods of mapping physical 

10 main memory addresses into the cache memory 
locations. Among these methods are: Fully associ- 
ative, Direct Mapped, and Set Associative. In a fully 
associative cache system, any block of main mem- 
ory can be represented in any cache memory line. 

75 In a direct mapped system, each block of main 
memory can be represented in only one particular 
cache memory location. In a set associative sys- 
tem, each block of main memory can only be 
placed into cache memory lines having the same 

20 set number. For more information on cache mem- 
ory mapping systems, please refer to Hennessy, 
Patterson, Computer Architecture: A Quantitative 
Approach, Morgan Kaufman Press, 1 990, page 408 
^~410 

25 In order to control the operation of the cache 

memory, there is dedicated control logic referred to 
as the cache controller (17, Figure 1). A table is 
located within the cache controller. The TAG table 
is used for storing information used for mapping 

30 main memory physical addresses into a cache 
memory set and line address. In particular, the 
TAG table stores block address and related control 
bits for each cache memory line. The block ad- 
dress refers to the physical main memory block 

35 address that is currently represented in the cache 
memory line. The control bits store information 
such as whether or not the cache memory line has 
valid data. In addition, the table stores data utilized 
to implement a cache replacement algorithm. The 

40 data table is divided to match the organization of 
the cache memory. 

When all the lines in a cache memory set 
become full and a new block of memory needs to 
be placed into the cache memory, the cache con- 

45 troller must discard the contents of part of the 
cache memory and replace it with the new data 
from main memory. Preferably, the contents of the 
cache memory line discarded will not be needed in 
the near future. However, the cache controller can 

so only predict which cache memory line should be 
discarded. As briefly noted earlier, in order to pre- 
dict as efficiently as possible, several cache re- 
placement heuristics have been developed. The 
presently used cache replacement heuristics in- 

55 elude Round- Robin, Random, Least-Recently-Used 
(LRU), and Pseudo-Least-Recently-Used. These 
heuristics determine which cache memory location 
to replace by looking only at the cache memory's 
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past performance. 

The Round-Robin replacement heuristic simply 
replaces the cache memory lines in a sequential 
order. When the last cache memory line is 
reached, then the controller starts back at the first 
cache memory line. 

The Least-Recently-Used (LRU) replacement 
scheme requires more intelligence at the cache 
controller. In the LRU heuristic, the assumption is 
that when a cache memory line has been accessed 
recently, it will most likely be accessed again in the 
near future. Based upon this assumption, then the 
cache memory line that that has been "least re- 
cently used" should be replaced by the cache 
controller. To implement the LRU heuristic, the 
cache controller must mark each cache memory 
line with a time counter each time there is a "hit" 
on that cache memory line. When the cache con- 
troller is forced to replace a cache memory line, 
the cache controller replaces the cache memory 
line with the oldest time counter value. In this 
manner the cache memory line which was "least 
recently used" will be replaced. 

Although the LRU heuristic is relatively effi- 
cient, it does have drawbacks. One problem with 
the LRU replacement scheme is that it wastes 
valuable high-speed cache memory. Each time a 
cache hit occurs, the cache controller must place a 
time counter value in memory location associated 
with the cache memory line. Another problem with 
the LRU replacement scheme is that it requires 
complex logic to implement. When a replacement 
must occur, the cache controller must compare all 
the cache memory line time counter values, this 
procedure wastes valuable time. When these fac- 
tors are accounted for, the LRU scheme loses 
some of its efficiency. 

The Pseudo-Least-Recently-Used (PLRU) re- 
placement scheme is somewhat similar to the LRU 
replacement scheme except that it requires less 
complex logic and does not require much high- 
speed cache memory to implement. However, 
since the PLRU scheme employs shortcuts to 
speed up operation, the least recently accessed 
cache memory location is not always the location 
replaced. In the PLRU replacement scheme each 
cache memory line is assigned an MRU (or Most- 
Recently-Used) bit which is stored in the TAG 
table. The MRU bit for each cache memory line is 
set to a "1 " each time a "hit" occurs on the cache 
memory line. Thus, a "1 " in the MRU bit indicates 
that the cache memory line has been used re- 
cently. When the cache controller is forced to re- 
place a cache memory line, the cache controller 
examines the MRU bits for each cache memory 
line looking for a "Cr. If the MRU bit for a particular 
cache memory line is set to a "1", then the cache 
controller does not replace that cache memory line 



since it was used recently. When the cache con- 
troller finds a memory line with the MRU bit set to 
"0", that memory line is replaced and the MRU bit 
associated with the cache memory line is then set 
5 to "1". 

A problem could occur if the MRU bits for all 
the cache memory lines are set to "1". If this 
happened, all of the lines are unavailable for re- 
placement thus causing a deadlock. To prevent this 

io type of deadlock, all the MRU bits in the TAG are 
cleared except for the MRU bit being accessed 
when a potential overflow situation is detected. If 
the cache is set-associative, all the MRU bits in the 
TAG array for the set are cleared, except for the 

75 MRU bit being accessed, when a potential overflow 
situation is detected because all of the MRU bits 
for the set are set to "1 n . 

The PLRU scheme is best explained by the 
use of an example. Referring to Figure 2, an 

20 example of the PLRU replacement scheme is illus- 
trated in a cache environment with 4 cache tines 
available. At step 1, all the MRU bits are cleared 
indicating that none of the cache lines have been 
used recently and all the cache lines are free for 

25 replacement. At step 2, a cache hit occurs on the 
data in line 3. The cache controller causes the 
MRU bit for line 3 to be set to "1 ", indicating that 
the data in line 3 has been used recently. Cache 
fines 0, 1, and 2 are still available. At step 3, a 

30 cache hit occurs on the data in line 1. The cache 
controller causes the MRU bit for line 1 to be set to 
"1 n , indicating that the data in line 1 has been used 
recently. At step 4, a cache hit occurs on the data 
in line 0. The cache controller similarly causes the 

35 MRU bit for line 0 to be set to "1", indicating that 
the data in line 0 has been used recently. Now, 
only Cache line 2 has not been marked as being 
used recently. At step 5, a cache hit occurs on the 
data in line 2. If the MRU bit for line 2 is set to a 

40 "1", all the MRU bits would be set to "1 " (1111) 
and no cache lines would be available for replace- 
ment. This would be a case of cache deadlock. 
Instead, the cache controller causes all of the MRU 
bits to be cleared and sets the MRU bit for line 2 to 

45 a "V r . Now lines 0, 1, and 3 are available for 
replacement. The act of clearing of all the MRU 
bits results in the loss of some cache history, but is 
required in order to avoid cache deadlock. The 
cache operations then continue as before. 

so However, these heuristics can be improved if 

some information is known about the cache mem- 
ory's future usage. For example, if it is known that 
a certain cache memory location will be used in the 
near future, it would be best not replace that cache 

55 memory location. In, the example given earlier, it 
was known that the program would access the data 
in the data table repeatedly. If the data table was 
placed into the cache memory in that case, it 
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would be advantageous to be able to "lock" that 
cache memory location so that it could not be 
replaced. If this was done, then each time the 
program subsequently needed information from the 
data table it would always be found in the cache 
memory. Therefore, the data in the data table 
would always be quickly fetched from the cache 
memory instead of having to be retrieved from the 
slower main memory. 

SUMMARY OF THE INVENTION 

It is therefore the object of the present inven- 
tion to provide an efficient method for replacing 
cache memory locations when the cache memory 
becomes full. 

It is a further object of the present invention to 
provide a method and apparatus for allowing pro- 
grams to lock certain cache memory locations into 
the cache memory so they will not be replaced. 

It is a further object of the present invention to 
prevent a user from causing "deadlock" of the 
cache memory by not allowing the user to lock all 
the cache memory locations. 

These and other objects are accomplished by 
the unique method and apparatus of the present 
invention. The method and apparatus of the 
present invention comprises a cache memory re- 
placement scheme which utilizes locking bits. 
These locking bits are preferably set and reset by 
the application program/process executing and are 
utilized in conjunction with cache replacement bits 
by the cache controller to determine the lines in 
the cache to replace. The locking bits "lock" the 
line of data in the cache until such time when the 
process resets the lock bit. By providing that the 
process controls the state of the lock bits, the 
intelligence and knowledge the process contains 
regarding the frequency of use of certain memory 
locations can be utilized to provide a more efficient 
cache. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The objects, features, and advantages of the 
present invention will be apparent to one skilled in 
the art from the following detailed description in 
which: 

FIGURE 1 is a high-level block diagram of " a T ' 
typical prior art cache memory system. 

FIGURE 2 illustrates an exemplary prior art 
pseudo-ieast-recently-used replacement process. 

FIGURE 3 illustrates a prior art set associative 
cache. 

FIGURES 4a, 4b and 4c illustrate a preferred 
embodiment of the cache system of the present 
invention and the locking bits employed. 



FIGURE 5 illustrates the STAG and PTAG ta- 
bles utilized in the preferred embodiment of the 
cache system of the present invention. 

FIGURE 6 illustrates a pseudo-least-recently- 
s used replacement process employing locking bits. 

DETAILED DESCRIPTION OF THE INVENTION 

A cache which implements a least recently 

10 used replacement algorithm is provided with the 
ability to lock certain memory locations in the 
cache. If a memory location in the cache is locked, 
the information contained therein remains in the 
cache until the lock is removed and the cache 

75 replacement algorithm determines that the line of 
the cache should be replaced. 

The tag table is provided with an additional bit, 
a lock bit, which is associated with each line of 
cache memory. Preferably this bit can be set by 

20 the process accessing that particular cache mem- 
ory location. The advantage is the added intel- 
ligence and pre-existing knowledge provided by 
the application program or process accessing the 
cache. The application program has pre-existing 

25 knowledge as to the frequency of access of certain 
variables or memory during execution of the pro- 
gram. This is not readily apparent to the cache 
controller implementing the replacement algorithm. 
Thus, increased intelligence is provided to the 

30 cache replacement algorithm without unduly in- 
creasing the complexity of the cache controller or 
cache replacement algorithm. 

In the following description, for purposes of 
explanation, specific nomenclature is set forth to 

35 provide a thorough understanding of the present 
invention. However, it will be apparent to one 
skilled in the art that these specific details are not 
required in order to practice the present invention. 
In other instances, well known circuits and devices 

40 are shown in block diagram form in order not to 
obscure the present invention unnecessarily. In 
particular, the present invention has been imple- 
mented using the set associative mapping system 
and a pseudo-least-recently-used replacement al- 

45 gorithm. However, as is apparent to one skilled in 
the art, the cache system of the present invention 
is not limited to cache memory systems with set 
associative mapping or to the pseudo-least- 
recently-used replacement algorithm. 

so Referring to Figure 3, a block diagram of a set 

exemplary of associative cache memory is shown. 
In the exemplary set associative cache memory 
system illustrated there are 64 cache memory 
"sets", each set is given a label from 0 to 63. Each 

55 set in the cache memory contains 4 "lines" of 
cache memory. Each line of cache memory in 
each set is given a label 0 through 3. Each cache 
memory line is capable of storing an entire "block" 
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of main memory. 

Like the cache memory, the main memory is 
also divided into a number of sets. The number of 
sets. that the main memory is divided into is equal 
to the number of sets in the cache memory. For 
example, as shown in Figure 3, the main memory 
is divided into 64 sets. The main memory is di- 
vided up according to the high order bits of the 
block address. Thus the first n blocks belong to set 
0, the next n blocks belong to set 1, and so on. It is 
apparent that the sets could just as easily be 
divided using the low order bits of the block ad- 
dress such that all block addresses which end in 0 
belong in set 0, and all block addresses which end 
in 1 belong to set 1. For example, set 0 encom- 
passes blocks 0, N, 2N ... 61 N, 62N, 63N; and set 
1 encompasses blocks 1, N + 1, 2N + 1 ... 61N + 1, 
62N + 1 , 63N + 1 . 

The main memory sets are considerably larger 
than the cache memory sets. Each set of main 
memory is then further divided into a number of 
memory blocks. Each block of main memory can 
only by duplicated in the cache memory having the 
same set number. For example, block 3 in set 0 
can only be duplicated in set 0 of the cache 
memory and block n + 1 in set 1 can only be 
duplicated in set 1 of cache memory. 

As previously mentioned, each set of cache 
memory is made up of a number of "lines" of 
cache memory. The "lines" of cache memory are 
equal in size to the "blocks" of main memory and 
are used for storing duplicates of main memory 
blocks. Essentially, cache memory lines and main 
memory blocks are the same, except that "lines" 
only exist in the cache memory and blocks only 
exist in the main memory. 

The locking mechanism for the cache system 
of the present invention may be conceptually de- 
scribed with reference to Figures 4a-4c. Figure 
4a shows cache 300 which contains the memory 
contents of addresses most recently accessed. 
Cache controller 310 controls the access to the 
cache 300 and implements the cache replacement 
algorithm to update the cache. The tag table 315 
contains information regarding the memory or tag 
address of the data contained in the cache as well 
as control bits. Referring to Figure 4b, an illustra- 
tive entry in the tag table is shown. One tag table 
entry is provided for each line in the cache. Tn 
addition to the address 325 and control bits 330, 
each entry is provided with a bit MRU 335 which is 
set when the cache at that particular line is acces- 
sed. This is utilized in the replacement algorithm 
implemented by the cache controller. In addition, a 
lock bit 340 is provided to prevent the line in the 
cache from being replaced. This lock bit is settable 
by the processor program accessing the cache and 
is similarly, resettable by that program when re- 



peated access to that information is no longer 
required and the line in the cache can be replaced. 
In implementation, the concept may be visualized 
by reference to Figure 4c. When the cache con- 

s troller is required to replace a line in the cache, the 
cache controller accesses the tag table to read the 
MRU and lock data. Thus, the lock data and MRU 
data may be logically ORed together to result in 
the replace bit indicative of whether that particular 

jo line in the cache can be replaced. This logical OR 
function may be performed by the cache controller 
itself or by external logic. The OR function result is 
known as the Composite Mask, rf the resultant 
replace bit in the Composite Mask is set, the line in 

15 the cache is not removed for replacement by a 
different memory location. Thus, regardless of the 
value of the MRU bit, the lock bit can be set to 
ensure that the data is maintained in the cache. 
Preferably, the tag table is implemented as two 

20 separate tag tables as shown in Figure 5. The first 
table PTAG 400 comprises the address information 
and control bits. The address refers to the physical 
main memory block address that is currently repre- 
sented in the cache memory line. Control bits 

25 include a valid bit which indicates rf the cache 
memory line contains valid data. In addition, a 
second table STAG 410 is provided. The STAG 
contains the MRU bits and the lock bits for each 
line of cache memory. As noted earlier, the MRU 

30 bit is used for implementing a Pseudo-Least- 
Recently-Used replacement scheme. 

The cache controller monitors the state of the 
composite mask to ensure that the composite mask 
never reaches the state where all the composite 

35 bits for all lines are set and cache deadlock occurs. 
In addition, to prevent all the cache memory lines 
from being locked by the user, it is preferred that a 
mechanism is provided to monitor the number of 
lock bits set and to inhibit additional lock requests 

40 by an application program if a predetermined num- 
ber of lock bits are set. The mechanism may be 
provided in the cache controller, the 
program/process or compiler. Alternately, to avoid 
the locking of all cache lines, it is preferred that 

45 cache' memory line 0 is controlled such that the 
lock bit is never set. This provides a simple low 
overhead solution to the problem and avoids dead- 
locks due to programmer errors. 

Referring to Figure 6, a sample use of the 

so replacement scheme of the present invention is 
given. At the initial starting point in step 1, all the 
MRU bits and lock bits are cleared. In step 2, a 
cache hit occurs on the data in line 3. The cache 
controller causes the MRU bit for cache memory 

55 line 3 to be set to "1", indicating that the data in 
line 3 has been used recently. Cache lines 0, 1, 
and 2 are still available. Next in step 3, the user 
program locks the data located at line 2. The cache 
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controller then sets the lock bit for cache memory 
line 2 to "1", indicating .that the data in line 2 is 
now locked into the cache. The composite mask, 
created by the logical "OFT of the MRU bits and 
the lock bits is "1100", indicate that cache lines 0 
and 1 are still available. In step 4, a hit occurs on 
the data in line 2. The cache controller causes the 
MRU bit for cache memory line 2 to be set to "1", 
indicating that the data in line 2 has been used 
recently. This composite mask remains "1100", 
indicating that cache lines 0 and 1 are still avail- 
able. In step 5, a hit occurs on the data located at 
line 0. The cache controller causes the MRU bit for 
cache memory line 0 to be set to T, indicating 
that the data in line 0 has been used recently. The 
resultant composite mask is "1101" indicating that 
only line 1 remains available for replacement. 

In step 6, a hit occurs on the data in line 1. If 
the cache controller causes the MRU bit to be set 
to "1", the composite mask would be "1111". 
Instead, the cache controller causes the MRU bits 
to be reset and the MRU bit for cache memory line 
1 to be set to "1 ". indicating that the data in line 1 
has been used recently. The resultant composite 
mask is now "0110" as the lock bit for line 2 
remains set. In step 7, the user program executes 
an instruction to lock the data in line 3. The cache 
controller carries out this instruction by causing the 
lock bit for line 3 to be set to "1". In step 8, a 
cache hit occurs on line 0. Again, the cache con- 
troller must clear the MRU bits to prevent a com- 
posite mask of "1111 w from occurring. In step 9, 
the user locks cache memory line 1. Now all the 
cache memory lines that can be locked are locked. 
To prevent the cache memory from being deadloc- 
ked, the system clears the MRU bits. Only cache 
memory line 0 is available for replacement when all 
the other lines are locked. In step 1 0, a hit occurs 
on line 0. The MRU bit for line 0 is not set by the 
cache controller since this would cause the com- 
posite mask to become "1111" causing cache 
memory deadlock. 

In step 11, a cache hit occurs on line 1. The 
MRU bit for line 1 is set to "1 " indicating that it has 
been used recently. Still, only cache memory line 0 
is available. In step 12, the user finally unlocks the 
cache memory line 2 by unlocking line 2. The 
composite mask now becomes W 1010", indicating 
that lines 0 and 2 are now available for. replace- 
ment. In step 13, when a hit occurs on line 0, the 
MRU bit for line 0 is set to "1". Unlike step 10, the 
setting of line 0*s MRU bit will now not cause 
deadlock because additional lines have been un- 
locked. 

As noted earlier, a distinctive advantage gained 
by utilizing the locking mechanism in the cache 
system of the present invention is the added intel- 
ligence provided to the cache replacement pro- 



cess. The lock bits are set by the application 
process thereby eliminating the intelligence re- 
quired to try to provide that knowledge at the 
cache controller level. One way to provide the 

5 request to lock certain cache memory lines is for 
the application program to program such request 
into the application program in the form of a pre- 
determined command or subroutine call. If the pro- 
grammer knows that certain variables or memory 

70 locations are to be accessed frequently during the 
execution of the program, after the first access, a 
special command may be issued to set the cor- 
responding lock bit The compiler compiling this 
program will recognize the command request and 

75 provide the proper code to execute the command. 

System Programs, such as operating system 
routines, some database or window system rou- 
tines may be used for controlling the locking as set 
forth in the present invention. Locking performed in 

20 the system programs boosts the performance of 
some key features used by application programs 
without any intervention from the application pro- 
grammer. For example, a programmer building a 
graphics package might use an efficient line draw- 

25 ing function provided by the operating system's 
graphics library. If this function were locked into 
the cache, the speed of execution of the graphics 
package can be indirectly increased. 

Preferably, the locking mechanism of the 

30 present invention has been provided for use 
through special assembly language instructions 
available for execution in supervisor mode only. A 
system call providing the lock and unlock line 
commands can easily be written to help a program- 

35 mer. This is a very powerful mechanism and 
should be used by a knowledgeable programmer 
only. For example, in the SPARC™ (SPARC is a 
trademark of SPARC International, Inc.) architecture 
a load/store instruction can be adapted to modify 

40 the lock bits. One way to adapt the load/store 
command is by reserving an ASI value to cor- 
respond to the location of the lock bits. When the 
CPU executes the instruction, the cache controller 
receives a command from the CPU to unlock/lock 

45 certain lock bits. The cache controller responds by 
issuing a command to set/reset specified lock bits 
in the tag array. For further information regarding 
the load/store instruction see, The SPARC Archi- 
tecture Manual, Version 8, pp. 45-49 (Prentiss Hall 

so 1992). 

Alternately, it is preferred that intelligent com- 
pilers are provided that perform an automated ana- 
lysis on the memory accesses to be performed to 
determine those memory accesses of high fre- 
55 quency which would benefit by having the cor- 
responding lock bit set. A command can then be 
automatically inserted into the compiled code to 
perform the locking and subsequently, the unloc- 
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king of the lock bits. This technique is advanta- 
geous as the decision- whether to lock certain ac- 
cesses in the cache is automatically determined by 
the compiler and would release the application 
programmer from making such a decision. 5 

Cache systems implementing the PLRU with 
locking feature as described above can exhibit sig- 
nificantly lower cache memory miss rates than 
ordinary PLRU cache systems. The gained effi- 
ciency is due to the "intelligence** added to the w 
cache replacement heuristic. The foregoing has 
described a method and apparatus for implement- 
ing a cache memory system with a pseudo-LRU 
replacement scheme with a locking feature. It is 
contemplated that changes and modifications may is 
be made by one of ordinary skill in the art. to the 
materials and arrangements of elements of the 
present invention without departing from the spirit 
and scope of the invention. 

20 

Claims 

1. In a computer system comprising master de- 
vices including a central processing unit 
(CPU), and a memory system comprising a 25 
main memory and a faster cache memory 
wherein a subset of the lines of main memory 

are stored in the cache memory for fast ac- 
cess by a master device issuing a request for 
access to memory, an apparatus for securing 30 
selected lines of main memory in cache mem- 
ory comprising: 

a tag table comprising tag bits and at least 
one lock bit for each line of the cache, said tag 
bits identifying the line of main memory lo- 35 
cated in cache wherein said tag bits are com- 
pared to determine if access to memory is 
provided by providing the line from the cache; 

means for controlling the state of the lock 
bit for each line of the cache located in the tag aq 
table; and 

replacement means for replacing a line of 
memory located in the cache with a different 
line of memory, said replacement means pro- 
hibited from replacing a line of memory in the 45 
cache if the corresponding lock bit in the tag 
table is set; 

wherein by setting the corresponding lock 
bit in the tag table, lines of memory located In 
the cache are secured in the cache regardless so 
of the cache replacement algorithm. 

2. The apparatus as set forth in claim 1 , wherein 
said tag table further comprises at least one 
replacement bit for each line in the cache, said 55 
replacement bit indicating usage of the line in 

the cache and is used by the replacement 
means to determine the line in the cache to 



replace. 

3. The apparatus as set forth in claim 2. wherein 
said replacement bit indicates recent usage of 
the line in the cache, said replacement means 
executing a least-recently-used (LRU) replace- 
ment algorithm to determine the line of the 
cache to replace. 

4. The apparatus as set forth in claim 2, wherein 
said replacement means logically ORs the re- 
placement bit and lock bit to generate a com- 
posite value used to determine the line of the 
cache to replace. 

5. The apparatus as set forth in claim 1, wherein 
said means for controlling the state of the lock 
bit bit further comprises a monitoring means to 
monitor the number of lock bits set in the tag 
array to prohibit all lock bits from being set. 

6. The apparatus as set forth in claim 1, wherein 
said means for controlling the state of the lock 
bit further comprises means for identifying at 
least one predetermined line of the cache as 
non-lockable such that deadlock is avoided. 

7. The apparatus as set forth in claim 1 , wherein 
said replacement means comprises a cache 
controller. 

8. The apparatus as set forth in claim 1, wherein 
said means for controlling the state of the lock 
bit comprises instructions issued by the master 
device. 

9. The apparatus as set forth in claim 8, wherein 
said means for controlling the state of the lock 
bit comprises instructions issued by an ap- 
plication process executing on the computer 
system. 

10. The apparatus as set forth in claim 9, wherein 
said instructions are processed by an operat- 
ing system of the computer system to issue 
commands to a cache controller to set the lock 
bit. 

11. The apparatus as set forth in claim 1, wherein 
said means for controlling the state of the lock 
bit- comprises a compiler to compile a program 
to generate compiled code to be executed, 
said compiler evaluating the program and in- 
serting commands to set/reset the lock bit in 
the compiled code. 

12. In a computer system comprising master de- 
vices including a central processing unit 
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(CPU), and a memory system comprising a 
main memory, a faster cache memory and a 
tag table associated with the cache memory, 
wherein a subset of the lines of main memory 
are stored in the cache memory for fast ac- 
cess by a master device issuing a request for 
access to memory, said tag table comprising 
tag bits, said tag bits identifying the line of 
main memory located in cache wherein said 
tag bits are compared to determine if access 
to memory is provided by providing the line 
from the cache, a method for securing se- 
lected lines of main memory in cache memory 
comprising the steps of: 

providing at least one lock bit associated 
with each line of the cache in the tag table; 

controlling the state of the lock bit for each 
line of the cache located in the tag table; and 

replacing a line of memory located in the 
cache with a different line of memory, such 
that lines of the cache having an associated 
lock bit which is set are not replaced; 

wherein by setting the corresponding lock 
bit in the tag table, lines of memory located in 
the cache are secured in the cache regardless 
of the cache replacement algorithm. 

13. The method as set forth in claim 12, wherein 
said tag table further comprises at least one 
replacement bit associated with each line of 
the cache, said step of replacing a line of the 
cache referencing the replacement bit and the 
lock bit to determine the line of the cache to 
replace. 

14. The method as set forth in claim 13. wherein 
said step of replacing logically ORs the re- 
placement bit and the lock bit to generate a 
composite mask used to determine the line of 
the cache to replace. 

15. The method as set forth in claim 13, wherein 
said step of replacing utilizes a least-recently- 
used algorithm to determine the line of the 
cache to replace. 

16. The method as set forth in claim 12, wherein 
the step of controlling the state of the lock bit 
comprises providing instructions to set/reset 7 
the lock bit in the application process execut- 
ing in the computer system. 

17. The methbd as set forth in claim 16, wherein 
the step of controlling the state of the lock bit 
further comprises issuing commands to 
set/reset the lock bit in the tag table when an 
instruction in the application program to 
set/reset the lock bit is executed. 



18. The method as set forth in claim 12, wherein 
the lock bits are set/reset during the execution 
of a program on the computer system, said 
method further comprising the step of compil- 

5 ing the program into compiled code to be 

executed, said step of compiling comprising: 
evaluating the program to determine when 

lock bits are to be set/reset for certain lines of 

memory placed in the cache; 
10 inserting commands to set/reset lock bits 

in the cache into the compiled code; and 

generating compiled code comprising the 

program and commands to set/reset the lock 

bits. 

75 

19. In a computer system comprising master de- 
vices including a central processing unit 
(CPU), and a memory system comprising a 
main memory and a faster cache memory 

so wherein a subset of the lines of main memory 

are stored in the cache memory for fast ac- 
cess by a master device issuing a request for 
access to memory, an apparatus for securing 
selected lines of main memory in cache mem- 

25 ory comprising: 

a tag table comprising tag bits, at least 
one replacement bit and at least one lock bit 
for each line of the cache, said tag bits iden- 
tifying the line of main memory located in 

30 cache wherein said tag bits are compared to 

determine if access to memory is provided by 
providing the line from the cache; 

an operating system for issuing commands 
to set/reset the lock bits located in the tag 

35 table; and 

a cache controller to control the contents 
of the cache and the tag table, said cache 
controller executing a replacement algorithm to 
replace a line of memory located in the cache 

40 with a different line of memory and updating 

the tag table, said cache controller prohibited 
from replacing a line of memory in the cache if 
the corresponding lock bit in the tag table is 
set; 

45 wherein by setting the corresponding lock 

bit in the tag table, lines of memory located in 
the cache are secured in the cache regardless 
of the cache replacement algorithm. 

so 20. The apparatus as set forth in claim 19, wherein 
said operating system communicates the com- 
mand to set/reset the lock bits to the cache 
controller which performs the setting/resetting 
of the lock bits. 
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